A Simple Algorithm for Identifying Negated Findings and Diseases in Discharge Summaries

نویسندگان

  • Wendy W. Chapman
  • Will Bridewell
  • Paul Hanbury
  • Gregory F. Cooper
  • Bruce G. Buchanan
چکیده

Narrative reports in medical records contain a wealth of information that may augment structured data for managing patient information and predicting trends in diseases. Pertinent negatives are evident in text but are not usually indexed in structured databases. The objective of the study reported here was to test a simple algorithm for determining whether a finding or disease mentioned within narrative medical reports is present or absent. We developed a simple regular expression algorithm called NegEx that implements several phrases indicating negation, filters out sentences containing phrases that falsely appear to be negation phrases, and limits the scope of the negation phrases. We compared NegEx against a baseline algorithm that has a limited set of negation phrases and a simpler notion of scope. In a test of 1235 findings and diseases in 1000 sentences taken from discharge summaries indexed by physicians, NegEx had a specificity of 94.5% (versus 85.3% for the baseline), a positive predictive value of 84.5% (versus 68.4% for the baseline) while maintaining a reasonable sensitivity of 77.8% (versus 88.3% for the baseline). We conclude that with little implementation effort a simple regular expression algorithm for determining whether a finding or disease is absent can identify a large portion of the pertinent negatives from discharge summaries.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluation of negation phrases in narrative clinical reports

OBJECTIVE Automatically identifying findings or diseases described in clinical textual reports requires determining whether clinical observations are present or absent. We evaluate the use of negation phrases and the frequency of negation in free-text clinical reports. METHODS A simple negation algorithm was applied to ten types of clinical reports (n=42,160) dictated during July 2000. We cou...

متن کامل

Journal of Biomedical Informatics

Narrative reports in medical records contain a wealth of information that may augment structured data for managing patient information and predicting trends in diseases. Pertinent negatives are evident in text but are not usually indexed in structured databases. The objective of the study reported here was to test a simple algorithm for determining whether a finding or disease mentioned within ...

متن کامل

Application of Evolutionary Algorithm to Optimization of ANNIS Model for Discharge Coefficient Circular Side Spillway Modeling

In this study, the discharge coefficient of the circular side orifices was predicted using a new hybrid method. Combinations made in this study were divided into two sections: 1) the combination of two algorithms including Particle Swarm Optimization (PSO) and Genetic Algorithm (GA) and providing the PSOGA algorithm 2) using the PSOGA algorithm in order to optimize the Adaptive Neuro Fuzzy Infe...

متن کامل

Identifying Smoking Status From Implicit Information in Medical Discharge Summaries

Human annotators and natural language applications are able to identify smoking status from discharge summaries with high accuracy when explicit evidence regarding their smoking status is present in the summary. We explore the possibility of identifying the smoking status from discharge summaries when these smoking terms have been removed. We present results using a Näıve Bayes classifier on a ...

متن کامل

Detecting Adverse Drug Events in Discharge Summaries Using Variations on the Simple Bayes Model

Detection and prevention of adverse events and, in particular, adverse drug events (ADEs), is an important problem in health care today. We describe the implementation and evaluation of four variations on the simple Bayes model for identifying ADE-related discharge summaries. Our results show that these probabilistic techniques achieve an ROC curve area of up to 0.77 in correctly determining wh...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Journal of biomedical informatics

دوره 34 5  شماره 

صفحات  -

تاریخ انتشار 2001